Picture for Tianyi Zhou

Tianyi Zhou

Sandboxed Coding Agents are Competitive Omni-modal Task Solvers

Add code
May 30, 2026
Viaarxiv icon

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Add code
May 29, 2026
Viaarxiv icon

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Add code
May 28, 2026
Viaarxiv icon

AI, Take the Wheel: What Drives Delegation and Trust in Human-Computer Cooperative Question Answering?

Add code
May 27, 2026
Viaarxiv icon

How Language Models Process Negation

Add code
May 04, 2026
Viaarxiv icon

Do Synthetic Trajectories Reflect Real Reward Hacking? A Systematic Study on Monitoring In-the-Wild Hacking in Code Generation

Add code
Apr 26, 2026
Viaarxiv icon

Superminds Test: Actively Evaluating Collective Intelligence of Agent Society via Probing Agents

Add code
Apr 24, 2026
Viaarxiv icon

Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks

Add code
Apr 22, 2026
Viaarxiv icon

Convergent Evolution: How Different Language Models Learn Similar Number Representations

Add code
Apr 22, 2026
Viaarxiv icon

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

Add code
Apr 20, 2026
Viaarxiv icon